ALGEBRAIC METHODS IN DISCRETE ANALOGS OF THE 
KAKEYA PROBLEM 



LARRY GUTH AND NETS HAWK KATZ 



Abstract. We prove the joints conjecture, showing that for any TV lines in 
R 3 , there are at most 0(iV5) points at which 3 lines intersect non-coplanarly. 
We also prove a conjecture of Bourgain showing that given TV 2 lines in R 3 so 
that no TV lines lie in the same plane and so that each line intersects a set P of 
points in at least TV points then the cardinality of the set of points is f2(TV 3 ). 
Both our proofs are adaptations of Dvir's argument for the finite field Kakeya 
problem. 



1. Introduction 

Various authors have considered the joints problem. It asks, given TV lines in 

space, how many "joints" can the lines form, where a joint is defined as a point 

where three lines with linearly independent directions intersect. Obviously given 

/ — / — I — 3 
a vf x vf x \/f cube in the integer lattice, we get TV lines with J^L joints 

by simply taking all lines in coordinate directions which intersect the cube and the 
lattice. The joints problem is to prove: 

Theorem 1.1. Any set of N lines in ffi 3 form at most 0(N%) joints. 

The previous best bound in the joints problem is due to Feldman and Sharir [6], 
who proved that the number of joints is 0(tV 16232 ). Earlier bounds were reported 
in [12], [10], and [2]. Bennett, Carbery, and Tao obtained a result conditioned on 
the angles at the joints in [1]. 

At AIM in 2004, Bourgain conjectured the following: 

Theorem 1.2. Let L be a set of N 2 lines in R 3 and let P be a set of points in R 3 . 
Suppose no more than N lines of L lie in the same plane and suppose the each line 
of L contains at least N points of P. Then \P\ = f2(TV 3 ). 



The previous best bound in Bourgain's incidence problem is due to Solymosi and 
Toth [11], who proved that the number of points is f2(TV n / 4 ). 

What both conjectures have in common is that they are discrete models of the 
Kakeya problem. Work of Sharir on joints helped inspire Schlag's program on 
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Kakeya, sec [9]. Bourgain's conjecture was posed with the analogy to Kakeya in 
mind. 

In 2008, Dvir ([5]) solved the finite field version of the Kakeya problem. His tech- 
nique was to study the properties of a polynomial which vanishes on the Kakeya 
set. We adapt this technique, proving the above theorems through a study of a 
polynomial which vanishes on the point sets in question. 

The main idea of both proofs is as follows. We find a polynomial of as low degree as 
possible which vanishes on the set of joints (resp. points) in question. We factorize 
the polynomial to irrcduciblcs and find an irreducible factor vanishing on many 
joints (points). That irreducible factor will also vanish on many lines. In the event 
that the gradient, too, vanishes on many lines, a variant of Bezout's theorem leads 
to a contradiction by reducing our irreducible. Otherwise at each point where such 
lines intersect, all the lines lie in the tangent plane. In the joints problem this leads 
to an immediate contradiction, since intersections must be non-coplanar. In the 
Bourgain problem, this leads to many flat points of the zero set of the irreducible, 
which force that zero set to be a plane. This contradicts the hypothesis of fewer 
than N lines in a plane. The idea of planincss, that in Kakeya problems, lines at 
a given point of intersection lie mainly in a plane, seems first to have arisen in the 
work of the second author with Laba and Tao [8]. The idea that this plane is the 
tangent space to a polynomial vanishing on the set comes from the work of the first 
author on the endpoint multilinear Kakeya problem [7] . 

We have tried to minimize the amount of algebra background needed for the paper. 
The small amount of algebra we use is summarized in the next section with refer- 
ences. The main ingredient is Bezout's theorem. In order to minimize the algebra, 
we focus on irreducible polynomials, and we use pigeonhole estimates to locate an 
irreducible polynomial that vanishes on many joints or points. It is also possible to 
give a proof using reducible polynomials. Such a proof would need less pigeonhole 
estimates, but it would require more algebra. 

Acknowledgements: The second author would like to thank the University of 
Toronto and the Fields Institute for their hospitality which allowed this collabora- 
tion to take place. He would also like to thank his colleague Michael Larsen for a 
useful conversation about resultants. He is partially supported by NSF grant DMS 
0653763. 

2. Algebraic Preliminaries 

In this section, we bring together various algebraic facts that we shall need. Good 
references are the books of Cox, Little, and O'Shea. ([3], [4]) 

We recall a fundamental object, the resultant of two polynomials. Given / and g 
elements of C[x] having degree / and m respectively, and given as 



f(x) = aix 1 + ai-ix l 1 H ha . 
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and 

g(x) = b m x m + b m ^x m - x + • • • + 6 , 
we define the resultant of / and g, namely Res(f,g) as the determinant of the 
I + to x / + to matrix whose coefficients Cjj satisfy Cjj = dj-i if 1 < z < to and 
i < j < i + I, satisfy = bj- i+m ifm+l<i<m + l and i — to < j < i — to + I 
and are equal to zero otherwise. 

The columns of the matrix Cjj represent coefficients of the polynomial / multiplied 
by where j runs from to to — 1 and the coefficents of the polynomial g mul- 
tiplied by x k where k runs from to I — 1. The resultant tests whether this set of 
polynomials is linearly independent. Linear independence fails exactly when / and 
g have a common factor. (The resultant was first defined in this way by Sylvester.) 
We bring together some basic properties of the resultant following [4] . 

Now suppose instead that we work with polynomials /,jgC[ii,..., x n \. We may 
view them as polynomials in x\ with coefficients that are polynomials in x 2 , . . . ,x n . 
Then we denote the resultant, a polynomial in x 2 , ■ ■ ■ , x n as Res(f, g;x\). In fact, 
computing resultants is all we need to do in order to determine whether polynomials 
in several variables have a common factor. 

Theorem 2.1. Let f, g E C[xi, . . . , x n ] and suppose that both f and g have positive 
degree when viewed as polynomials in x\ then f and g have a common factor if and 
only if Res(f,g;xi) is identically zero. 

Theorem 2.1 is §3.6 proposition 1 (ii) in [4]. 

Proposition 2.2. Let f and g be elements of C[x±, x 2 ] and suppose that f and g 
have degrees I and m respectively. Furthermore, assume that f has degree I in X\ 
and g has degree m in x\. Then Res(f,g;x\) is a polynomial of x 2 of degree at 
most Im. 



Proof Given two polynomials of one variable, 

f(x) = (x - n)(x -r 2 )...(x- n), 



and 

we have that 



g{x) = (x- si)(x -s 2 )...(x- s m ), 



Res{f,g) = \\ \\r 3 - s k . 

3=1 k=i 

The coefficient of / is a symmetric polynomial in the roots of / which is ho- 
mogeneous of degree I — i. Similarly, hi is a homogenenous polynomial of degree 
to — i in the roots of g. We therefore assign the variable ai a degree I — i and the 
variable bi the degree ra — i. With respect to these degrees, the resultant Res(f,g) 
is a homogeneous polynomial of degree Im. On the other hand, the coefficient aj is 
a polynomial in x 2 of degree at most I — i and the coefficient bi is a polynomial in 
x 2 of degree at most m — i. Therefore, Res(f, g; x\) is a polynomial of x 2 of degree 
at most Im. ■ 
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Proposition 2.2 is the main point in the proof of the celebrated Bezout theorem. 

Corollary 2.3. (Bezout's theorem) Let f and g be elements o/C[a;i,a;2] and sup- 
pose that f and g have positive degrees I and m respectively. Suppose there are more 
than Im points of C 2 where f and g both vanish. Then f and g have a common 
factor. 

Proof We begin by changing coordinates. We may choose x\ and x 2 so that / 
has degree I in x\ and g has degree rn in X\ . We may also guarantee that there are 
more than Im distinct values of X2 among the points where / and g both vanish. A 
generic choice of coordinates accomplishes these goals. By Proposition 2.2, we see 
that Res(f,g;xi) is a polynomial of X2 with degree at most Im. Since it vanishes 
at more than Im points, it must vanish identically An application of Theorem 2.1 
completes the proof. ■ 

We shall need a small generalization of corollary 2.3 that works in C 3 when two 
polynomial vanish simultaneously on many lines. 

Corollary 2.4. Let f and g be elements of C[xi, x%, Xs] and suppose that f and 
g have positive degrees I and m respectively. Suppose there are more than Im lines 
on which f and g simultaneously vanish identically. Then f and g have a common 
factor. 

Proof Without loss of generality, we may choose X\ so that / and g have positive 
degree in X\ and x% so that the x% — plane is transverse to at least lm+1 of the 
lines of vanishing. Then fixing x% and apply Bezout's theorem and theorem 2.1, we 
get that Res(f,g;xi) vanishes identically as a function of x 2 . Since this happens 
for all values of 23, we have that Res(f,g;xi) vanishes identically and therefore 
applying theorem 2.1, we get the desired result. ■ 

Finally we prove the real analog of corollary 2.4. The result below is the one we 
apply in the proof of our theorems. 

Corollary 2.5. Let f and g be elements of M\xi, x 2 , X3], and suppose that f and 
g have positive degrees I and m respectively. Suppose that there are more than Im 
lines on which f and g simultaneously vanish identically. Then f and g have a 
common factor. 

Proof We can think of / and g as elements of C[xi,x 2 ,x 3 ], and they must vanish on 
more than Im complex lines in C 3 . By corollary 2.4, / and g must have a common 
factor h in C[a;i, x%, X3]. We can assume h is irreducible. A priori, the polynomial 
h may or may not be real. But, if h is non-real, then the irreducible factorization 
of / must contain both h and h. Hence / is divisible by the real polynomial hh. 
Similarly, g is divisible by hh. ■ 

We take this moment to state an additional algebraic proposition which we will use 
in what follows. 
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Proposition 2.6. Let X be a set of N points in R 3 . Then there is a nontrivial 
polynomial in R[xi, X2, £3] (a fortiori in C[xi,X2,xs]) which vanishes on every point 
of X of degree less than CN$ , with C a universal constant. 

Proof A polynomial of three variables and degree d has ( d + 3 )( rf + 2 )( d + 1 ) coefficients. 
Requiring that a polynomial vanish at a point gives a homogeneous linear equation 
for the coefficients. Underdetermined systems of homogeneous linear equations 
always have nontrivial solutions. ■ 



3. Geometric Preliminaries 



In this section, we will recall some basic facts of the extrinsic geometry of irreducible 
algebraic varieties in M 3 . 

We let p be a nontrivial irreducible polynomial on R 3 of degree d > 0. We consider 

S = {(x,y,z) : p(x,y,z) = 0}. 

We say that a point a E S is critical if Vp(a) = 0. Otherwise, we say that a is 
regular. (By the implicit function theorem, S is locally a manifold in a neighborhood 
of a regular point.) We say a line I is critical if I C S and every point of I is critical. 

Proposition 3.1. The set S contains no more than d{d — 1) critical lines. 

Proof Suppose not. We apply corollary 2.5 to p and a nontrivial component of 
\7p. This contradicts the irreducibility of p. ■ 

Next, we begin to investigate regular points of S. We immediately get the following. 

Proposition 3.2. Let a be a regular point of S. Let I be a line contained in S 
which passes through a. Then I C T a S, where T a S is the tangent plane to S at a. 

Let a be a regular point of S. We would like to investigate the extrinsic curvature 
of S at a. That is, we want to understand infinitcsimally how the direction of Vp 
is changing in a neighborhood of a in S. We define 

U(p)(a) = {V V pxe 3 Vp X Vf)}j=l,2,3, 

where x is the cross product of vectors and ei, e2, are the standard basis vectors 
in R 3 . Clearly II(p) is a set of three vectors. Thus it has nine components. Each 
of the components is a polynomial of degree no more than 3d — 4. 

We will refer to II(p) as the algebraic second fundamental form of S. (The geometric 
second fundamental form is a quadratic form on T p S obtained by differentiating 
the unit normal vector to S along S. However since the algebraic fundamental 
form measures the normal component of the change of Vp along three directions 
which span T p S, it is easy to see that for any regular point a, all the components 
of II(p)(a) vanish if and only if the second fundamental form of £ vanishes. 
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We say a regular point a of S is flat if all the components of ll(p)(a) vanish. We 
give a sufficient condition for a regular point a to be flat. 

Lemma 3.3. Let a be a regular point of S. Suppose that S contains three distinct 
lines all of which intersect at a, then a is a flat point. 



Proof The quadratic surface which most closely approximates S at a contains 
the three lines. (This is because the Taylor series of p along the lines vanishes.) 
However so does T a S. Thus since 3 > 2 by corollary 2.5 the quadratic surface most 
closely approximating S must contain T a S. Therefore, the second fundamental 
form of S vanishes at a. ■ 



We say that a line I is fiat if it is contained in S, it is not critical, and every regular 
point of I is flat. 

Corollary 3.4. Let p be an irreducible polynomial of degree d > 0. Let S be the 

zero set of p. Suppose that S contains more than 'id 2 — Ad flat lines, then S is a 
plane. 



Proof By corollary 2.5, each component of II(p) has p as a factor. Therefore the 
direction of the normal to S at regular points of S does not vary. Therefore S 
contains a plane. But p is irreducible so S is a plane. ■ 



4. Analytic preliminaries 

In the following two sections we will prove "big oh" results by contradiction. Thus 
we will make an assumption involving a large constant K. We keep track of anything 
that depends on K. However, we ignore constants which are independent of K. 
Thus we write 

A<B 

if A and B are quantities and C is a universal constant. 

There are two variants of the pigeonhole principle which we will use frequently. The 
first is often referred to as the popularity lemma. 

Lemma 4.1. Let (X, Y, E) be a bipartite graph with E the edges and X and Y the 
two sets of vertices. Suppose that \E\ > p\Y\. Let Y' be the set of vertices of Y 
having degree at least p and let E' be the set of edges in E between Y' and X Then 

\E'\>{p-n)\Y\. 

Proof The vertices in Y\Y' are incident to at most fj,\Y\ edges. ■ 



We now give the second which we'll refer to freely as the pigeonhole principle. 
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Lemma 4.2. Let xi, . . . , x m be positive quantities and yi, . . . , y m positive quanti- 
ties, then there is an integer 1 < k < m so that 

Xk_ YsjLl X 3 
— \-^m 

Vk 2^,-=i Vj 



J2 m - x ■ 

Proof Suppose not. Let p = ^r 1 - 3 . Then Xk < pyt for all k. Thus 

m m 

Y^xj < pj^yj, 

which is a contradiction. 



5. Joints Problem 



In this section, we prove theorem 1.1. 

We suppose that we are given a set of lines L of cardinality N. Let J be the set of 
joints determined by L. We suppose 

|J| > KNi, 

with K a large, but universal, constant. 

We create a bipartite, three-colored graph (L, J, R, G, B) between the set of lines 
and the set of joints. Each joint is incident to at least three noncoplanar lines. For 
each joint, we arbitrarily color one of the incidences red, one green, and one blue. 
The sets R, G, and B consist of, respectively, the red, green, and blue incidences. 

We will now refine the sets slightly. We let Lr be the set of all lines which have at 
least j§)qN2 incidences in R. We let Jr be the set of joints having a red incidence 
with a line of Lr. By lemma 4.1, 

qqq 

Now we let Lq and Lb be those lines having respectively at least y^q-'V 5 green or 
blue incidences with joints in Jr. We let J' denote that set of joints which has red, 
green, and blue incidences with lines in Lq and Lb and by lemma 4.1, it is easy to 
show that 

qq 

I J' I > — \J\. 
I — 100 

Our goal now is to produce a polynomial of relatively low degree vanishing on all 
the points of J'. (Any degree which is substantially lower than N? will suffice. We 
say a line I of Lq or Lb meets a line I' of Lr if I fl I' is a joint of Jr. Each line of 
Lq and each line of Lb meets at least lines of Lr. We now take a random 

subset L' R of the lines of Lr, picking each line with probability j^. 
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N2 



By the law of large numbers, with positive probability, the following events occur: 
Each line of Lq and Lb meets at least 2^50 -/V 2 ^ nes °^ ^'r anc * 

w <- ¥■ 

We make a set of points S by selecting points on each line of L' R . Then 

\s\< K . 

We find a polynomial p which vanishes on all points of S. By the estimate on the 

i 

size of S, we may select p with degree d which is < ^4-. With K sufficiently large, 

this means that p must vanish on each line of L' R and because of the number of lines 
of L' R that each line of Lq and Lb meet, it means that p must vanish identically 
on each line of Lq and Lb- Therefore, the polynomial p must vanish on the entire 
set J'. 

Now, it is not necessarily the case that p is irreducible. Thus we factor p into 
irreducibles 

m 

We denote the degree of the polynomial pj by dj and observe that 
We let Jj be the subset of J' on which pj vanishes, and we have 

m 



Thus by lemma 4.2, we find j for which 

\J 3 \>K^Nd 3 . 
^From now on, we restrict our attention to Jj. 

We denote by Lr.JjLgj, and Lbj those lines in Lr, Lb, and Lq which are incident 
to at least dj + 1 elements of Jj , and we let Jj be those element of Jj incident to a 
line each from LRj,Lc,j, and Lbj- With K sufficiently large, we have 

qqq 

We now define L' R }1 L' G ., and L' B ■ as the set of lines which are incident to more 
than dj + 1 points of Jj. We define J" to be the set of joints defined by these lines. 
We have 
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We now break into two cases. In the first case, there are fewer than d 2 lines in each 
of L' R j,L' G j, and L' B -. In this case, we start over again, having a joints problem 
with fewer lines and more favorable exponents than the original. 

In the second case, we may assume without loss of generality that L' R ■ contains at 
least d 2 lines. By the definition of Lrj, Lq ^, and Ls.j, the polynomial pj vanishes 
identically on each line in these sets. However, this implies that each point of Jj is 
a critical point of pj , because otherwise it would be impossible for pj to vanish on 
three, intersecting, non-coplanar lines. But this implies that each component of the 
gradient of pj vanishes at each point of Jj . Let q be one of the components of the 
gradient which does not vanish identically. Then q has degree at most dj — 1. Thus, 
it must vanish on every line of L' R . But this is a contradiction by proposition 3.1. 



6. Bourgain's Incidence Problem 



In this section, we prove theorem 1.2. We suppose we have a set of points X C K 3 
of cardinality , with K large to be specified later and a set L of N 2 lines so that 
no N lines lie in the same plane and so that each line I £ L is incident to at least N 
points of X. We may assume in what follows that each line is incident to exactly 
iV points by coloring N of its incidences black and only counting black incidences 
below. 

We say that a point x e X is valuable if it is incident to at least lines. We 
define v(x) the value of x to be the number of lines it is incident to. We let X v be 
the set of valuable points. Clearly 

\x v \ < 1000N3 



and by lemma 4.1 



v(x) > 



K ' 
999iV 3 



^ w " 1000 

xex v 



We now perform some dyadic pigeonholing to uniformizc the quantity v(x). We 
define X 3 - to be the set of those x e X v so that 



We define 



Then note that 



Note also that 



< v(x) < . 

1000 " v ; 1000 



xeXj 



j=i xeXv 



999N 3 
1000 ' 



i=i J 
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Hence, by the pigeonhole principle, we can fix a positive number j so that 

> 999iV 3 
j ~ 2000j 2 ' 

(In the argument below, the reader is encouraged to assume that j = 1, since this 
is indeed the worst case.) 

^From this we see that 

N 3 N 3 
>\XA > 



Next we find a polynomial p which vanishes on every point of Xj . We may choose 
p to have degree d satisfying 

N 



d< 



The polynomial p need not be irreducible. Thus we factor it 

P = PlP2---Pm, 

with pk having degree dk- We let Xj t k be the set of points of Xj on which pk 
vanishes. Clearly, we have 

N 

di +d 2 H h d m < — 

if 3 23 

while 

+ \x jt 2\ + ■■■ + \X jtm \ > \Xj\ > 

Thus by the pigeonhole principle, we can fix a k with 

We let y = Xj.fe and by the definition of Xj, if / is the number of incidences 
between L and Y, we have 

/ > N 2 d k Khij- 2 » N 2 d k . 

We let L' be the set of lines incident to more than 100(4 points of Y and let I' be 
the number of incidences between L' and Y. Then clearly 

r>i. 

Note that each line of L' is in the zero set of p k ■ Now let Y' be the set of points 
of Y incident to more than 3 lines of L'. Then each point of Y' is cither a critical 
point of pk = or else by lemma 3.3, it must be a flat point of pk- 

We let L" be the set of lines in L' incident to at least lOdfe points of of Y'. If I" is 
the number of incidences between lines of L" and points of Y', we still have 

I" >N 2 d k Khij- 2 . 

We let I fiat and I C rit be the number of those incidences with flat points and critical 
points respectively. Note that 

Icrit + I flat > 
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There are two cases. In the first case 

I cnt >N 2 d k Khij- 2 , 

which means that there are at least NdkK^2i j~ 2 >> d\ lines in the surface pk 
on which are critical. But this is a contradiction in light of proposition 3.1. 



In the second case 

I fiat > N 2 d k Khij- 2 , 

which means that there are at least Nd k K^2i j~ 2 >> 3d 2 flat lines in the surface 
Pk = 0. In light of corollary 3.4 the surface p k = is in fact a plane. But now we 
have more than N lines of L lying in a plane which is also a contradiction. 
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